03. Quiz: MC Control Methods
Quiz: MC Control Methods
In this lesson, we'll work with a simple gridworld example to illustrate the main ideas. The gridworld is identical to the environment that we examined when learning about Monte Carlo (MC) methods. Please watch the next video to refresh your memory.
## Video
L602 Gridworld Example RENDER V2-2
Before learning about Temporal-Difference control methods, check your knowledge of Constant-\alpha MC control by watching the video below.
## Video
Quiz: MC Control Methods
## Quiz
Say that an agent is learning to navigate the gridworld described in the above videos. Suppose the agent is using Constant-\alpha MC control in its search for the optimal policy, with \alpha=0.1. At the end of the 99th episode, the Q-table has the following values:
data:image/s3,"s3://crabby-images/75838/75838f3ac5a9088f8e0eea8e4f9f753a55fb7844" alt="Q-table"
Q-table
Say that the 100th episode is printed below.
data:image/s3,"s3://crabby-images/46d00/46d00555cb3bc7236f9dac7ad876485aefde823a" alt="100th episode"
100th episode